Leveraging Clinical Time-Series Data for Prediction: A Cautionary Tale

نویسندگان

  • Eli Sherman
  • Hitinder S. Gurm
  • Ulysses J. Balis
  • Scott R. Owens
  • Jenna Wiens
چکیده

In healthcare, patient risk stratification models are often learned using time-series data extracted from electronic health records. When extracting data for a clinical prediction task, several formulations exist, depending on how one chooses the time of prediction and the prediction horizon. In this paper, we show how the formulation can greatly impact both model performance and clinical utility. Leveraging a publicly available ICU dataset, we consider two clinical prediction tasks: in-hospital mortality, and hypokalemia. Through these case studies, we demonstrate the necessity of evaluating models using an outcome-independent reference point, since choosing the time of prediction relative to the event can result in unrealistic performance. Further, an outcome-independent scheme outperforms an outcome-dependent scheme on both tasks (In-Hospital Mortality AUROC .882 vs. .831; Serum Potassium: AUROC .829 vs. .740) when evaluated on test sets that mimic real-world use.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Vehicle's velocity time series prediction using neural network

This paper presents the prediction of vehicle's velocity time series using neural networks. For this purpose, driving data is firstly collected in real world traffic conditions in the city of Tehran using advance vehicle location devices installed on private cars. A multi-layer perceptron network is then designed for driving time series forecasting. In addition, the results of this study are co...

متن کامل

Some New Methods for Prediction of Time Series by Wavelets

Extended Abstract. Forecasting is one of the most important purposes of time series analysis. For many years, classical methods were used for this aim. But these methods do not give good performance results for real time series due to non-linearity and non-stationarity of these data sets. On one hand, most of real world time series data display a time-varying second order structure. On th...

متن کامل

Ensemble Kernel Learning Model for Prediction of Time Series Based on the Support Vector Regression and Meta Heuristic Search

In this paper, a method for predicting time series is presented. Time series prediction is a process which predicted future system values based on information obtained from past and present data points. Time series prediction models are widely used in various fields of engineering, economics, etc. The main purpose of using different models for time series prediction is to make the forecast with...

متن کامل

Semiparametric Bootstrap Prediction Intervals in time Series

One of the main goals of studying the time series is estimation of prediction interval based on an observed sample path of the process. In recent years, different semiparametric bootstrap methods have been proposed to find the prediction intervals without any assumption of error distribution. In semiparametric bootstrap methods, a linear process is approximated by an autoregressive process. The...

متن کامل

Evaluation of SARIMA time series models in monthly streamflow estimation in Idanak hydrometry station

prediction of hydrological variables is a highly effective tool in water resource management. One of the important tools for modeling hydrological processes is the use of time series modeling and analysis. River series production series can be used by time series models in various studies such as drought, flood, reservoir systems design and many other purposes For this purpose, monthly flow dat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017